Mining Partially Periodic Event Patterns with Unknown Periods
نویسندگان
چکیده
We consider the problem of discovering partially periodic temporal associations, which we call p-patterns. In computer networks, a p-pattern might consist of ve repetitions every 30 seconds of a port-down event followed by a port-up event, which in turn is followed by a random gap until the next ve repetitions of these events. Finding p-patterns consists of two sub-tasks: (1) nding period lengths (e.g. 30 seconds) and (2) nding temporal association (e.g., port-down followed by port-up). While a variation of Apriori can be employed in the second sub-task, the rst sub-task has not been addressed (to the best of our knowledge). Moreover, how these two tasks are combined has signiicant implications. We develop an algorithm for nding periods using a chi-squared test, and we study the performance of this algorithm in the presence of noise. Further, we develop two algorithms for discovering p-patterns based on whether periods or temporal associations are discovered rst, and we examine trade-oos between these approaches. One result is that the association-rst algorithm has a higher tolerance to noise while the period-rst algorithm is more computation-ally eecient. From comparisons based on synthetic data, we observe that the association-rst algorithm is as eeective as the period-rst approach if the noise to signal ratio (NSR) is less than one. This motivates us to apply the period-rst algorithm to data from a production computer network. Numerous actionable patterns are discovered, including a possible security intrusion.
منابع مشابه
Discovering Partial Periodic Patterns in Discrete Data Sequences
The problem of partial periodic pattern mining in a discrete data sequence is to find subsequences that appear periodically and frequently in the data sequence. Two essential subproblems are the efficient mining of frequent patterns and the automatic discovery of periods that correspond to these patterns. Previous methods for this problem in event sequence databases assume that the periods are ...
متن کاملAn Adaptive Sensor Mining Model for Pervasive Computing Applications
In the past decade, the fields of pervasive computing, sensor design and data mining have begun to converge. As a result, there is a wealth of sensor data that can be analyzed with the goals of identifying patterns and automating sequences. Mining sequences of sensor events brings unique challenges to the KDD community. The challenge is heightened when the underlying data source is dynamic and ...
متن کاملMining Sequential Pattern With Synchronous And Asynchronous Periodic Time Stamp Using Hash Based Algorithm
Sequential Pattern Mining (SPM) is considered as an interesting data mining problem conducted in research works. In SPM, the time-series pattern can be used as a tool for identifying the behavior of the patterns while mining sequential pattern types of data. Mining time series patterns in temporal dataset acts as an imperative function in data mining and knowledge detection mechanisms. Several ...
متن کاملUsing Convolution to Mine Obscure Periodic Patterns in One Pass
The mining of periodic patterns in time series databases is an interesting data mining problem that can be envisioned as a tool for forecasting and predicting the future behavior of time series data. Existing periodic patterns mining algorithms either assume that the periodic rate (or simply the period) is user-specified, or try to detect potential values for the period in a separate phase. The...
متن کاملEfficient Mining of Partial Periodic Patterns in Time Series Database In ICDE 99
Partial periodicity search, i.e., search for partial periodic patterns in time-series databases, is an interesting data mining problem. Previous studies on periodicity search mainly consider finding full periodic patterns, where every point in time contributes (precisely or approximately) to the periodicity. However, partial periodicity is very common in practice since it is more likely that on...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001